Data provenance for preservation of digital geoscience data

نویسندگان

  • Beth Plale
  • Bin Cao
  • Chathura Herath
  • Yiming Sun
چکیده

A necessary first step in the preservation of digital scientific data is gathering enough information “about” a scientific outcome or data collection, that it can be discovered and used a decade from now as easily as it is reused next week. Data provenance, or lineage of a collection, can capture how a particular scientific collection was created, when and by whom. Our goal is to devise tools automate the collection of provenance so that this task does not fall onto the researcher, and to efficiently store and represent the provenance data that makes the data more amenable to long term preservation. We demonstrate through application to several projects that automated provenance collection can reach the level of necessary provenance but challenges remain in addressing provenance collection in a non-workflow setting, and in data preservation in cyberinfrastructure architectures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Secure Provenance for Data Preservation Repositories

Importance of research data preservation and management has been accepted by the scientists all around the world. Interest and investment in data preservation projects has become higher than ever before. Already there are number of wellknown research data repositories for different types of research data. Data preservation, sharing, discovery and reuse are the key features which are common acro...

متن کامل

Authenticity and Provenance in Long Term Digital Preservation: Modeling and Implementation in Preservation Aware Storage

A growing amount of digital objects is designated for long term preservation a time scale during which technologies, formats and communities are very likely to change. Specialized approaches, models and technologies are needed to guarantee the long-term understandability of the preserved data. Maintaining the authenticity (trustworthiness) and provenance (history of creation, ownership, accesse...

متن کامل

Digital Preservation in Data-Driven Science: On the Importance of Process Capture, Preservation and Validation

Current digital preservation is strongly biased towards data objects: digital files of document-style objects, or encapsulated and largely self-contained objects. To provide authenticity and provenance information, comprehensive metadata models are deployed to document information on an object’s context. Yet, we claim that simply documenting an objects context may not be sufficient to ensure pr...

متن کامل

Digital Archives and Preservation

DEFINITION Preservation is the set of processes that maintain the authenticity, integrity, and chain of custody of records for long periods of time. Authenticity is defined as the management of provenance information about the creation of the record. Integrity is defined as the ability to create an authentic copy. Chain of custody tracks all processing done to the record, including migration to...

متن کامل

Provenance Description of Metadata using PROV with PREMIS for Long-term Use of Metadata

Provenance description is necessary for long-term preservation of digital resources. Open Archival Information System (OAIS) and Preservation Metadata: Implementation Strategies (PREMIS), which are well-known standards designed for digital preservation, define descriptive elements for digital preservation. Metadata has to be preserved as well as primary resource in order to keep the primary res...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009